Musical Audio Synthesis Using Autoencoding Neural Nets
نویسندگان
چکیده
With an optimal network topology and tuning of hyperparameters, artificial neural networks (ANNs) may be trained to learn a mapping from low level audio features to one or more higher-level representations. Such artificial neural networks are commonly used in classification and regression settings to perform arbitrary tasks. In this work we suggest repurposing autoencoding neural networks as musical audio synthesizers. We offer an interactive musical audio synthesis system that uses feedforward artificial neural networks for musical audio synthesis, rather than discriminative or regression tasks. In our system an ANN is trained on frames of low-level features. A high level representation of the musical audio is learned though an autoencoding neural net. Our real-time synthesis system allows one to interact directly with the parameters of the model and generate musical audio in real time. This work therefore proposes the exploitation of neural networks for creative musical applications.
منابع مشابه
A Neural Network Principal Component Synthesizer for Expressive Control of Musical Sounds
This dissertation introduces a connectionist model that maps perceptual controllers to synthesis parameters to allow for an intuitive and powerful musical control of audio synthesis. This model, or system, allows the extraction, abstraction, reproduction and transformation of relevant features of a musician's style. All the information is deduced exclusively from audio. No prior knowledge of th...
متن کاملMusical Attractors: a New Method for Audio Synthesis
In this paper, we use mathematical tools developed for chaos theory and time series analysis and apply them to the analysis and resynthesis of musical instruments. In particular, we can embed a basic one-dimensional audio signal time series within a higher-dimensional space to uncover the underlying generative attractor. Röbel [7],[8] described a neural-net model for audio sound synthesis based...
متن کاملFeature for Musical Pitch Estimation from Simplified Auditory Model
A simplified auditory model has been used for calculating an enhanced summary auto-correlation or ESACF, which can be used as a tool for musical pitch estimation from audio signal. The model itself is not only computationally efficient but its ESACF also shows a good result for single pitch estimation. However, using this ESACF for multiple pitch estimation seems to be very difficult to analyse...
متن کاملSynthesizing Audio for Hindi WordNet
In this paper, we describe our work on the creation of a voice model using a speech synthesis system for the Hindi Language. We use preexisting “voices”, use publicly available speech corpora to create a “voice” using the Festival Speech Synthesis System (Black, 1997). Our contribution is two-fold: (1) We scrutinize multiple speech synthesis systems and provide an extensive report on the curren...
متن کاملSpeech synthesis using warped linear prediction and neural networks
A text-to-speech synthesis technique, based on warped linear prediction (WLP) and neural networks, is presented for high-quality individual sounding synthetic speech. Warped linear prediction is used as a speech production model with wide audio bandwidth yet with highly compressed control parameter data. An excitation codebook, inverse filtered from a target speaker’s voice, is applied to obtai...
متن کامل